Picture for Julia Kempe

Julia Kempe

Formalizing Mathematics at Scale

Add code
May 28, 2026
Viaarxiv icon

Efficient RL Training for LLMs with Experience Replay

Add code
Apr 09, 2026
Viaarxiv icon

Likelihood-Based Reward Designs for General LLM Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Add code
Jan 26, 2026
Viaarxiv icon

Outcome-based Exploration for LLM Reasoning

Add code
Sep 08, 2025
Viaarxiv icon

Tuning without Peeking: Provable Privacy and Generalization Bounds for LLM Post-Training

Add code
Jul 02, 2025
Viaarxiv icon

PILAF: Optimal Human Preference Sampling for Reward Modeling

Add code
Feb 06, 2025
Figure 1 for PILAF: Optimal Human Preference Sampling for Reward Modeling
Figure 2 for PILAF: Optimal Human Preference Sampling for Reward Modeling
Figure 3 for PILAF: Optimal Human Preference Sampling for Reward Modeling
Figure 4 for PILAF: Optimal Human Preference Sampling for Reward Modeling
Viaarxiv icon

Flavors of Margin: Implicit Bias of Steepest Descent in Homogeneous Neural Networks

Add code
Oct 29, 2024
Viaarxiv icon

On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds

Add code
Oct 21, 2024
Figure 1 for On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds
Figure 2 for On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds
Figure 3 for On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds
Figure 4 for On the Geometry of Regularization in Adversarial Training: High-Dimensional Asymptotics and Generalization Bounds
Viaarxiv icon

Emergent properties with repeated examples

Add code
Oct 09, 2024
Figure 1 for Emergent properties with repeated examples
Figure 2 for Emergent properties with repeated examples
Figure 3 for Emergent properties with repeated examples
Figure 4 for Emergent properties with repeated examples
Viaarxiv icon